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<110> Genencor International, Inc. 
Dunn-Coleman, N. 
Shetty, J. 
Duan, G. 
Sung, A. 
Qian, Y. 

<120> A Method for the Preparation of a High Purity Rice Protein 
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<130> GC830-PCT 

<140> 10587006 
<141> 2007-12-21 

<150> PCT/US05/05320 
<151> 2005-02-17 

<150> US 60/547,153 
<151> 2004-02-23 

<160> 11 

<170> Patentln version 3.2 

<210> 1 
<211> 2103 
<212> DNA 

<213> Humicola grisea var . thermoidea 
<400> 1 

atgcatacct tctccaagct cctcgtcctg ggctctgccg tccagtctgc cctcgggcgg 60 

cctcacggct cttcgcgtct ccaggaacgc gctgccgttg ataccttcat caacaccgag 120 

aagcccatcg catggaacaa gctgctcgcc aacatcggcc ctaacggcaa agccgctccc 180 

ggtgccgccg ccggcgttgt gattgccagc ccttccagga cggaccctcc ttgtacgtgg 240 

tggcatggaa tggacccaag agactggttt tagatgaaag agagtttctg ctaaccgcca 30 0 

cacccagact tcttcacctg gacccgcgat gccgccctgg tcctcaccgg catcatcgag 360 

tcccttggcc acaactacaa caccaccctg cagaccgtca tccagaacta cgtcgcgtcg 420 

caggccaagc tgcagcaggt ctcgaacccc tcgggaacct tcgccgacgg ctcgggtctc 480 

ggtgaggcca agttcaatgt cgacctcact gccttcactg gcgaatgggg tcgccctcag 540 

agggacggcc cgcccctgcg cgccatcgct ctcatccagt acgccaagtg gctgatcgcc 600 

aacggctaca agagcacggc caagagcgtc gtctggcccg tcgtcaagaa cgatctcgcc 660 

tacacggccc agtactggaa cgagaccggc ttcgatctct gggaggaggt ccccggcagc 720 

tcgttcttta ccatcgccag ctctcacagg ggtgagtcat ttattgttca gtgttttctc 780 

attgaataat taccggaatg ccactgacgc caaacagctc tgactgaggg tgcttacctc 840 

gccgctcagc tcgacaccga gtgccgcgcc tgcacgaccg tcgcccctca ggttctgtgc 900 

ttccagcagg ccttctggaa ctccaagggc aactatgtcg tctccaacag taagatccct 960 

acaccaacaa aaaaaatcga aaaggaacgt tagctgaccc ttctagtcaa cggcggcgag 102 0 

tatcgctccg gcaaggacgc caactcgatc ctggcgtcca tccacaactt cgaccctgag 1080 

gccggctgcg acaacctgac cttccagccc tgcagcgagc gcgccctggc caaccacaag 1140 

gcctatgtcg actcgttccg caacctctac gccatcaaca agggcatcgc ccagggcaag 1200 

gccgttgccg tcggccgcta ctcggaggat gtctactaca acggcaaccc gtggtacctg 1260 

gccaactttg ccgccgccga gcagctctac gacgccatct acgtgtggaa caagcagggc 1320 

tccatcaccg tgacctcggt ctccctgccc ttcttccgcg accttgtctc gtcggtcagc 1380 

accggcacct actccaagag cagctcgacc ttcaccaaca tcgtcaacgc cgtcaaggcc 1440 



tacgccgacg gcttcatcga ggtggcggcc aagtacaccc cgtccaacgg cgcgctcgcc 1500 

gagcagtacg accgcaacac gggcaagccc gactcggccg ccgacctgac gtggtcgtac 1560 

tcggccttcc tctcggccat cgaccgccgc gcgggtctcg tccccccgag ctggcgggcc 1620 

agcgtggcca agagccagct gccgtccacc tgctcgcgca tcgaggtcgc cggcacctac 1680 

gtcgccgcca cgagcacctc gttcccgtcc aagcagaccc cgaacccctc cgcggcgccc 1740 

tccccgtccc cctacccgac cgcctgcgcg gacgctagcg aggtgtacgt caccttcaac 1800 

gagcgcgtgt cgaccgcgtg gggcgagacc atcaaggtgg tgggcaacgt gccggcgctg 1860 

gggaactggg acacgtccaa ggcggtgacc ctgtcggcca gcgggtacaa gtcgaatgat 1920 

cccctctgga gcatcacggt gcccatcaag gcgacgggct cggccgtgca gtacaagtat 1980 

atcaaggtcg gcaccaacgg gaagattact tgggagtcgg accccaacag gagcattacc 2040 

ctgcagacgg cgtcgtctgc gggcaagtgc gccgcgcaga cggtgaatga ttcgtggcgt 2100 

taa 2103 

<210> 2 
<211> 634 
<212> PRT 

<213> Humicola grisea var . thermoidea 
<400> 2 

Met His Thr Phe Ser Lys Leu Leu Val Leu Gly Ser Ala Val Gin Ser 

15 10 15 

Ala Leu Gly Arg Pro His Gly Ser Ser Arg Leu Gin Glu Arg Ala Ala 

20 25 30 

Val Asp Thr Phe lie Asn Thr Glu Lys Pro lie Ala Trp Asn Lys Leu 

35 40 45 

Leu Ala Asn lie Gly Pro Asn Gly Lys Ala Ala Pro Gly Ala Ala Ala 

50 55 60 

Gly Val Val lie Ala Ser Pro Ser Arg Thr Asp Pro Pro Tyr Phe Phe 
65 70 75 80 

Thr Trp Thr Arg Asp Ala Ala Leu Val Leu Thr Gly lie lie Glu Ser 

85 90 95 

Leu Gly His Asn Tyr Asn Thr Thr Leu Gin Thr Val lie Gin Asn Tyr 

100 105 110 

Val Ala Ser Gin Ala Lys Leu Gin Gin Val Ser Asn Pro Ser Gly Thr 

115 120 125 

Phe Ala Asp Gly Ser Gly Leu Gly Glu Ala Lys Phe Asn Val Asp Leu 

130 135 140 

Thr Ala Phe Thr Gly Glu Trp Gly Arg Pro Gin Arg Asp Gly Pro Pro 
145 150 155 160 

Leu Arg Ala lie Ala Leu lie Gin Tyr Ala Lys Trp Leu lie Ala Asn 

165 170 175 

Gly Tyr Lys Ser Thr Ala Lys Ser Val Val Trp Pro Val Val Lys Asn 

180 185 190 

Asp Leu Ala Tyr Thr Ala Gin Tyr Trp Asn Glu Thr Gly Phe Asp Leu 

195 200 205 

Trp Glu Glu Val Pro Gly Ser Ser Phe Phe Thr lie Ala Ser Ser His 

210 215 220 

Arg Ala Leu Thr Glu Gly Ala Tyr Leu Ala Ala Gin Leu Asp Thr Glu 
225 230 235 240 

Cys Arg Ala Cys Thr Thr Val Ala Pro Gin Val Leu Cys Phe Gin Gin 

245 250 255 

Ala Phe Trp Asn Ser Lys Gly Asn Tyr Val Val Ser Asn lie Asn Gly 

260 265 270 

Gly Glu Tyr Arg Ser Gly Lys Asp Ala Asn Ser lie Leu Ala Ser lie 

275 280 285 

His Asn Phe Asp Pro Glu Ala Gly Cys Asp Asn Leu Thr Phe Gin Pro 
290 295 300 



Cys Ser Glu Arg Ala Leu Ala Asn His Lys Ala Tyr Val Asp Ser Phe 
305 310 315 320 

Arg Asn Leu Tyr Ala lie Asn Lys Gly lie Ala Gin Gly Lys Ala Val 

325 330 335 

Ala Val Gly Arg Tyr Ser Glu Asp Val Tyr Tyr Asn Gly Asn Pro Trp 

340 345 350 

Tyr Leu Ala Asn Phe Ala Ala Ala Glu Gin Leu Tyr Asp Ala lie Tyr 

355 360 365 

Val Trp Asn Lys Gin Gly Ser lie Thr Val Thr Ser Val Ser Leu Pro 

370 375 380 

Phe Phe Arg Asp Leu Val Ser Ser Val Ser Thr Gly Thr Tyr Ser Lys 
385 390 395 400 

Ser Ser Ser Thr Phe Thr Asn lie Val Asn Ala Val Lys Ala Tyr Ala 

405 410 415 

Asp Gly Phe lie Glu Val Ala Ala Lys Tyr Thr Pro Ser Asn Gly Ala 

420 425 430 

Leu Ala Glu Gin Tyr Asp Arg Asn Thr Gly Lys Pro Asp Ser Ala Ala 

435 440 445 

Asp Leu Thr Trp Ser Tyr Ser Ala Phe Leu Ser Ala lie Asp Arg Arg 

450 455 460 

Ala Gly Leu Val Pro Pro Ser Trp Arg Ala Ser Val Ala Lys Ser Gin 
465 470 475 480 

Leu Pro Ser Thr Cys Ser Arg lie Glu Val Ala Gly Thr Tyr Val Ala 

485 490 495 

Ala Thr Ser Thr Ser Phe Pro Ser Lys Gin Thr Pro Asn Pro Ser Ala 

500 505 510 

Ala Pro Ser Pro Ser Pro Tyr Pro Thr Ala Cys Ala Asp Ala Ser Glu 

515 520 525 

Val Tyr Val Thr Phe Asn Glu Arg Val Ser Thr Ala Trp Gly Glu Thr 

530 535 540 

lie Lys Val Val Gly Asn Val Pro Ala Leu Gly Asn Trp Asp Thr Ser 
545 550 555 560 

Lys Ala Val Thr Leu Ser Ala Ser Gly Tyr Lys Ser Asn Asp Pro Leu 

565 570 575 

Trp Ser lie Thr Val Pro lie Lys Ala Thr Gly Ser Ala Val Gin Tyr 

580 585 590 

Lys Tyr lie Lys Val Gly Thr Asn Gly Lys lie Thr Trp Glu Ser Asp 

595 600 605 

Pro Asn Arg Ser lie Thr Leu Gin Thr Ala Ser Ser Ala Gly Lys Cys 

610 615 620 

Ala Ala Gin Thr Val Asn Asp Ser Trp Arg 
625 630 



<210> 3 

<211> 604 

<212> PRT 

<213> Humicola grisea var . thermoidea 



<400> 3 

Ala Ala Val Asp Thr Phe lie Asn Thr Glu Lys Pro lie Ala Trp Asn 

15 10 15 

Lys Leu Leu Ala Asn lie Gly Pro Asn Gly Lys Ala Ala Pro Gly Ala 

20 25 30 

Ala Ala Gly Val Val lie Ala Ser Pro Ser Arg Thr Asp Pro Pro Tyr 

35 40 45 

Phe Phe Thr Trp Thr Arg Asp Ala Ala Leu Val Leu Thr Gly lie lie 
50 55 60 



Glu Ser Leu Gly His Asn Tyr Asn Thr Thr Leu Gin Thr Val lie Gin 
65 70 75 80 

Asn Tyr Val Ala Ser Gin Ala Lys Leu Gin Gin Val Ser Asn Pro Ser 

85 90 95 

Gly Thr Phe Ala Asp Gly Ser Gly Leu Gly Glu Ala Lys Phe Asn Val 

100 105 110 

Asp Leu Thr Ala Phe Thr Gly Glu Trp Gly Arg Pro Gin Arg Asp Gly 

115 120 125 

Pro Pro Leu Arg Ala lie Ala Leu lie Gin Tyr Ala Lys Trp Leu lie 

130 135 140 

Ala Asn Gly Tyr Lys Ser Thr Ala Lys Ser Val Val Trp Pro Val Val 
145 150 155 160 

Lys Asn Asp Leu Ala Tyr Thr Ala Gin Tyr Trp Asn Glu Thr Gly Phe 

165 170 175 

Asp Leu Trp Glu Glu Val Pro Gly Ser Ser Phe Phe Thr lie Ala Ser 

180 185 190 

Ser His Arg Ala Leu Thr Glu Gly Ala Tyr Leu Ala Ala Gin Leu Asp 

195 200 205 

Thr Glu Cys Arg Ala Cys Thr Thr Val Ala Pro Gin Val Leu Cys Phe 

210 215 220 

Gin Gin Ala Phe Trp Asn Ser Lys Gly Asn Tyr Val Val Ser Asn lie 
225 230 235 240 

Asn Gly Gly Glu Tyr Arg Ser Gly Lys Asp Ala Asn Ser lie Leu Ala 

245 250 255 

Ser lie His Asn Phe Asp Pro Glu Ala Gly Cys Asp Asn Leu Thr Phe 

260 265 270 

Gin Pro Cys Ser Glu Arg Ala Leu Ala Asn His Lys Ala Tyr Val Asp 

275 280 285 

Ser Phe Arg Asn Leu Tyr Ala lie Asn Lys Gly lie Ala Gin Gly Lys 

290 295 300 

Ala Val Ala Val Gly Arg Tyr Ser Glu Asp Val Tyr Tyr Asn Gly Asn 
305 310 315 320 

Pro Trp Tyr Leu Ala Asn Phe Ala Ala Ala Glu Gin Leu Tyr Asp Ala 

325 330 335 

lie Tyr Val Trp Asn Lys Gin Gly Ser lie Thr Val Thr Ser Val Ser 

340 345 350 

Leu Pro Phe Phe Arg Asp Leu Val Ser Ser Val Ser Thr Gly Thr Tyr 

355 360 365 

Ser Lys Ser Ser Ser Thr Phe Thr Asn lie Val Asn Ala Val Lys Ala 

370 375 380 

Tyr Ala Asp Gly Phe lie Glu Val Ala Ala Lys Tyr Thr Pro Ser Asn 
385 390 395 400 

Gly Ala Leu Ala Glu Gin Tyr Asp Arg Asn Thr Gly Lys Pro Asp Ser 

405 410 415 

Ala Ala Asp Leu Thr Trp Ser Tyr Ser Ala Phe Leu Ser Ala lie Asp 

420 425 430 

Arg Arg Ala Gly Leu Val Pro Pro Ser Trp Arg Ala Ser Val Ala Lys 

435 440 445 

Ser Gin Leu Pro Ser Thr Cys Ser Arg lie Glu Val Ala Gly Thr Tyr 

450 455 460 

Val Ala Ala Thr Ser Thr Ser Phe Pro Ser Lys Gin Thr Pro Asn Pro 
465 470 475 480 

Ser Ala Ala Pro Ser Pro Ser Pro Tyr Pro Thr Ala Cys Ala Asp Ala 

485 490 495 

Ser Glu Val Tyr Val Thr Phe Asn Glu Arg Val Ser Thr Ala Trp Gly 

500 505 510 

Glu Thr lie Lys Val Val Gly Asn Val Pro Ala Leu Gly Asn Trp Asp 



Thr Ser Lys Ala Val Thr Leu Ser Ala Ser Gly Tyr Lys Ser Asn Asp 

530 535 540 

Pro Leu Trp Ser lie Thr Val Pro lie Lys Ala Thr Gly Ser Ala Val 

545 550 555 560 

Gin Tyr Lys Tyr lie Lys Val Gly Thr Asn Gly Lys lie Thr Trp Glu 

565 570 575 

Ser Asp Pro Asn Arg Ser lie Thr Leu Gin Thr Ala Ser Ser Ala Gly 

580 585 590 

Lys Cys Ala Ala Gin Thr Val Asn Asp Ser Trp Arg 



<210> 4 
<211> 10739 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> pTrex3g_Nl3 plasmid 
<400> 4 

aagcttacta gtacttctcg agctctgtac atgtccggtc gcgacgtacg cgtatcgatg 60 

gcgccagctg caggcggccg cctgcagcca cttgcagtcc cgtggaattc tcacggtgaa 120 

tgtaggcctt ttgtagggta ggaattgtca ctcaagcacc cccaacctcc attacgcctc 180 

ccccatagag ttcccaatca gtgagtcatg gcactgttct caaatagatt ggggagaagt 240 

tgacttccgc ccagagctga aggtcgcaca accgcatgat atagggtcgg caacggcaaa 300 

aaagcacgtg gctcaccgaa aagcaagatg tttgcgatct aacatccagg aacctggata 360 

catccatcat cacgcacgac cactttgatc tgctggtaaa ctcgtattcg ccctaaaccg 420 

aagtgcgtgg taaatctaca cgtgggcccc tttcggtata ctgcgtgtgt cttctctagg 480 

tgccattctt ttcccttcct ctagtgttga attgtttgtg ttggagtccg agctgtaact 540 

acctctgaat ctctggagaa tggtggacta acgactaccg tgcacctgca tcatgtatat 600 

aatagtgatc ctgagaaggg gggtttggag caatgtggga ctttgatggt catcaaacaa 660 

agaacgaaga cgcctctttt gcaaagtttt gtttcggcta cggtgaagaa ctggatactt 720 

gttgtgtctt ctgtgtattt ttgtggcaac aagaggccag agacaatcta ttcaaacacc 780 

aagcttgctc ttttgagcta caagaacctg tggggtatat atctagagtt gtgaagtcgg 840 

taatcccgct gtatagtaat acgagtcgca tctaaatact ccgaagctgc tgcgaacccg 900 

gagaatcgag atgtgctgga aagcttctag cgagcggcta aattagcatg aaaggctatg 960 

agaaattctg gagacggctt gttgaatcat ggcgttccat tcttcgacaa gcaaagcgtt 1020 

ccgtcgcagt agcaggcact cattcccgaa aaaactcgga gattcctaag tagcgatgga 108 0 

accggaataa tataataggc aatacattga gttgcctcga cggttgcaat gcaggggtac 1140 

tgagcttgga cataactgtt ccgtacccca cctcttctca acctttggcg tttccctgat 1200 

tcagcgtacc cgtacaagtc gtaatcacta ttaacccaga ctgaccggac gtgttttgcc 1260 

cttcatttgg agaaataatg tcattgcgat gtgtaatttg cctgcttgac cgactggggc 1320 

tgttcgaagc ccgaatgtag gattgttatc cgaactctgc tcgtagaggc atgttgtgaa 1380 

tctgtgtcgg gcaggacacg cctcgaaggt tcacggcaag ggaaaccacc gatagcagtg 1440 

tctagtagca acctgtaaag ccgcaatgca gcatcactgg aaaatacaaa ccaatggcta 1500 

aaagtacata agttaatgcc taaagaagtc atataccagc ggctaataat tgtacaatca 1560 

agtggctaaa cgtaccgtaa tttgccaacg gcttgtgggg ttgcagaagc aacggcaaag 1620 

ccccacttcc ccacgtttgt ttcttcactc agtccaatct cagctggtga tcccccaatt 1680 

gggtcgcttg tttgttccgg tgaagtgaaa gaagacagag gtaagaatgt ctgactcgga 1740 

gcgttttgca tacaaccaag ggcagtgatg gaagacagtg aaatgttgac attcaaggag 1800 

tatttagcca gggatgcttg agtgtatcgt gtaaggaggt ttgtctgccg atacgacgaa 1860 

tactgtatag tcacttctga tgaagtggtc catattgaaa tgtaaagtcg gcactgaaca 1920 

ggcaaaagat tgagttgaaa ctgcctaaga tctcgggccc tcgggccttc ggcctttggg 1980 

tgtacatgtt tgtgctccgg gcaaatgcaa agtgtggtag gatcgaacac actgctgcct 2040 

ttaccaagca gctgagggta tgtgataggc aaatgttcag gggccactgc atggtttcga 2100 

atagaaagag aagcttagcc aagaacaata gccgataaag atagcctcat taaacggaat 2160 



gagctagtag gcaaagtcag cgaatgtgta tatataaagg ttcgaggtcc gtgcctccct 2220 

catgctctcc ccatctactc atcaactcag atcctccagg agacttgtac accatctttt 2280 

gaggcacaga aacccaatag tcaaccatca caagtttgta caaaaaagca ggctccgcgg 2340 

ccgccccctt caacatgcat accttctcca agctcctcgt cctgggctct gccgtccagt 2400 

ctgccctcgg gcggcctcac ggctcttcgc gtctccagga acgcgctgcc gttgatacct 2460 

tcatcaacac cgagaagccc atcgcatgga acaagctgct cgccaacatc ggccctaacg 2520 

gcaaagccgc tcccggtgcc gccgccggcg ttgtgattgc cagcccttcc aggacggacc 2580 

ctccttgtac gtggtggcat ggaatggacc caagagactg gttttagatg aaagagagtt 2 640 

tctgctaacc gccacaccca gacttcttca cctggacccg cgatgccgcc ctggtcctca 2700 

ccggcatcat cgagtccctt ggccacaact acaacaccac cctgcagacc gtcatccaga 2760 

actacgtcgc gtcgcaggcc aagctgcagc aggtctcgaa cccctcggga accttcgccg 2820 

acggctcggg tctcggtgag gccaagttca atgtcgacct cactgccttc actggcgaat 2880 

ggggtcgccc tcagagggac ggcccgcccc tgcgcgccat cgctctcatc cagtacgcca 2 940 

agtggctgat cgccaacggc tacaagagca cggccaagag cgtcgtctgg cccgtcgtca 3000 

agaacgatct cgcctacacg gcccagtact ggaacgagac cggcttcgat ctctgggagg 3060 

aggtccccgg cagctcgttc tttaccatcg ccagctctca caggggtgag tcatttattg 3120 

ttcagtgttt tctcattgaa taattaccgg aatgccactg acgccaaaca gctctgactg 3180 

agggtgctta cctcgccgct cagctcgaca ccgagtgccg cgcctgcacg accgtcgccc 3240 

ctcaggttct gtgcttccag caggccttct ggaactccaa gggcaactat gtcgtctcca 3300 

acagtaagat ccctacacca acaaaaaaaa tcgaaaagga acgttagctg acccttctag 3360 
t caacg 



